JENSEN'S TRACE INEQUALITY 



IN SEVERAL VARIABLES 

Frank Hansen & Gert K. Pedersen 
4 th March, 2003 

Abstract. For a convex, real function / we present a simple proof of the formula 

iK/(E3r=i«fc*fc«fc)) < ^(£r =1 «fc/(*fcK)> 

valid for each tuple (xi, . . . ,x m ) of symmetric matrices in M n and every unital col- 
umn (ai, . . . , a m ) of matrices, i.e. £feLi a fe a fe = !• This is the standard Jensen trace 
inequality. If / > it holds also for the unbounded trace on B(55), where £) is an 
infinite-dimensional Hilbert space. We then investigate the more general case where 
t is a densely defined, lower semi-continuous trace on a C* — algebra A and / is a 
convex, continuous function of n variables, and show that we have the inequality 

r (/(£r=l K*k*k)) < r (£r=i a* k f{x k )a k ) 
for every family of abelian n— tuples x k = (xik, . . . ,x n fe), i.e. tuples of self-adjoint 
elements in A such that [a^fc, Xjk] = for all i,j and k, where 1 < k < m, and 
every unital m— column (ai,...,a m ) in M(A), provided that the elements yi = 
"Yl^—i a k Xik a k a l so form an abelian n— tuple. We even establish this result for weak* 
measurable, self-adjoint, abelian fields (xa)t<=T, 1 < i < n, i.e. [xu, Xjt] = for all 
i,j and t, and a weak* measurable, unital column field (at)teT in M(A) paired with 
any trace or trace-like functional tp, i.e. one that contains the n— tuple (presumed 
abelian) with elements yi = J T a^xuat dfi(t) in its centralizer. This takes the form 
of the inequality 

We also study functions of n variables that are monotone increasing in each variable, 
and show in two important cases that tp(f(x)) < y(/(y)) whenever x = (x\, . . . , x n ) 
and y = (yi , . . . , y n ) are abelian n— tuples with Xi < yi for each i and <p is a trace or 
a trace-like functional. 



1. Introduction. Several important concepts in operator theory, in quantum 
statistical mechanics (the entropy, the relative entropy and Gibbs' free energy), in 
electrical engineering and in mathematical economics involve the trace of a function 
of a self-adjoint operator. This has motivated a considerable amount of abstract 
research about such functions in the last fifty years. An important subset of ques- 
tions concern the convexity of trace functions with respect to their argument, and 
the generalizations of this known as Jensen trace inequalities. 
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The convexity of the function x — > Tr(f(x)), when / is a convex function of 
one variable and x is a self-adjoint operator, was known to von Neumann, cf. [21, 
V.3. p. 390]. An early proof for f(x) = exp(x) can be found in [29, 2.5.2]. A 
proof found by E.H. Lieb in the early seventies describes the number Tr(f(x)), 
where / is convex, as a supremum (taken over all possible choices of orthonormal 
bases of the Hilbert space) of the sum of the values of / at the diagonal elements 
of the matrix for x. Obviously, then, this is a convex function of x. The proof was 
communicated to B. Simon, who used the method to give an alternative proof of the 
second Berezin-Lieb inequality in [30, Theorem 2.4], see also [31, Lemma II. 10.4]. 
Simon only considers the exponential function, but the argument is valid for any 
convex function, cf. [17, Proposition 3.1]. The general case for an arbitrary normal 
trace on a von Neumann algebra was established by D. Petz in [28, Theorem 4], 
using the theory of spectral dominance (spectral scale). 

When a convex combination X)fc=i ^k%k of matrices (or operators) with coef- 
ficients (Ai,..., A m ) is replaced by the non-commutative version YlT=i a k x k a k, 
where (a\, . . . ,a m ) is a unital m— column, i.e. an m— tuple of matrices (or oper- 
ators) such that Y^k=i a k a k = 1? we obtain a generalization known as Jensen's 
operator inequality. For an operator convex function, i.e. a function / such that 
f(Xx + (1 — X)y) < Xf(x) + (1 — A)/(y) for any pair of self-adjoint matrices x and y 
(of arbitrary high order), this result was found by the first author in [6], and used 
by the two of us in [9] to give a concise review of Lowner's and Bendat-Sherman's 
theory of operator monotone and operator convex functions. With hindsight we 
must admit that we unfortunately chose the contractive form f(a*xa) < a* f(x)a 
for a*a < 1, this being the seemingly most attractive version at the time. However, 
this necessitated the further conditions that G / and f(0) < 0, conditions that 
have haunted the theory since then, and which become a real obstacle when we pass 
to several variables. The Jensen inequality for a trace on a von Neumann algebra 
and an arbitrary convex function / was found by Brown and Kosaki in [5], still 
in the contractive version. Elementary proofs of these results can now be found in 
[11]. 

We begin the paper with the simple proof of the full Jensen trace inequality 
for matrices taken from [11], which uses ideas from Lieb's proof mentioned above. 
Although this result follows from the more general theorem later on in the paper we 
feel that an elementary proof of the most applicable version would be a convenience 
for the (not too specialized) reader. Also, the simple proof contains all the basic 
ideas in the more elaborate versions and thus makes it easier to grasp these. 

2. Theorem. If f: I — > K. is a convex function defined on an interval I the 
inequality 

Tr ( / j < (jt a tf( x k)a^J (1) 

holds for each m-tuple of self-adjoint nxn matrices (x±, . . . , x m ) with spectra in I, 
every unital m-tuple (ai, . . . , a m ) of n x n matrices and all natural numbers n. 

Proof. Let Xk = Y1 sp (x fe ) ^fc(^) denote the spectral resolution of Xk for 1 < k < m. 
Thus, Ek(\) is the spectral projection of xt on the eigenspace corresponding to A if 
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the (atomic) probability measure 

/ m 



yk=l 



k=l 



for any (Borel) set S in R. Note now that if y = X)fcli a* k Xkdk then 

/ m 




(3) 

K k=l sp(x fc ) 

If a unit vector £ is an eigenvector for y then the corresponding eigenvalue is (y£|£) 
and £ is also an eigenvector for /(y) with correponding eigenvalue (/(y)£|£) = 
/((yClC))- I n this case we therefore have 



e = (mm) = mm)) 



= f(J Xd^(X)j < J f(X)d^(X) (4) 

ml \ m 

= E E /(A)^ fe (A)a fc £ a fc £ = X>;/(a*)a fc £ | fl , 

fc=i y sp (x fe ) / fc=i 

where we used (3) and the convexity of / - in form of the usual Jensen inequality 
- to get the inequality in (4). 

The result in (1) now follows by summing over an ortho normal basis of eigen- 
vectors for y. □ 

3. Spectral Theory in Several Variables. The really new problems start 
when we consider a function /(A) of n real variables (with A = (Ai, A2, • • • , A n )). 
Naturally, we wish to replace the real variables Xj by self-adjoint operators Xj as in 
the one- variable case. An immediate problem that now arises is how to define f(x) 
in this case. The spectral theorem which was used in the one- variable case fails here 
unless the Xj's commute with one another. This means that the largest domain of 
definition for / is the set of abelian n— tuples in B(^), i.e. tuples x = (x\, . . . ,x n ) 
of self-adjoint elements such that [2^, xj] = for all i and j. 

For functions of two variables the spectral theory of abelian tuples (pairs) is 
equal to spectral theory for normal, instead of self-adjoint operators. [As long as 
we consider only continous and not differentiable functions, a complex function is 
just a function of two real variables !] This theory is markedly more difficult than 
the one variable case, in particular because the set of normal operators has no linear 
structure. 

To be more specific, consider a C*— algebra A of operators on some Hilbert space 
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with spectra contained in i". If £ = Ii x • • • x I n C M n and / is a continuous function 
on £ we can for each abelian n— tuple x = {x±, • • • , x n } in A^ define an element 
f(x) in A. To see this, let X{ = J XdEi(X) be the spectral resolution of Xi for 
1 < i < n. Since the x^s commute, so do their spectral measures. We can therefore 
define the product spectral measure E on £ by E(S\ x • • • x S n ) = Ei(S\) ■ ■ ■ E n (S n ), 
and then write 

f(x) = J /(A) dE(X) = J f(X u • • • , A n ) dE{X u ■ ■ • , A n ). (5) 

Of course, if / is a polynomial in the variables Ai, . . . , A n we simply find f(x) by 
replacing each Ai with Xi. The map / — > f(x) so obtained is a *— homomorphism of 
C(£) into A and generalizes the ordinary spectral mapping theory for a single (self- 
adjoint) operator. In particular, the support of the map (the smallest closed set S 
such that f(x) = for every function / that vanishes on S) may be regarded as the 
"joint spectrum" of the elements X\, . . . , x n . In Gelfand language the commutative 
unital C*— subalgebra generated by the x^s is *— isomorphic to C(S). 

4. Convexity in Several Variables. The set of abelian n— tuples in M(Sj) is 
obviously not a convex set, so at first glance it makes little sense to discuss convexity 
properties of the operator function x f(x). We shall therefore consider abelian 
tuples x and y that are compatible, which by definition means that the line segment 
between them also consists of abelian tuples. It is easily seen that this happens 
precisely when 

[xi, yj] = [xj, yi] for all % and j. (6) 

Now we can meaningfully ask whether Tr(f(Xx+(l — X)y) < Tr(Xf(x) + (l — X)f(y)) 
when / is a convex function. 

Note from (6) that if {xi, ■ ■ ■ ,x m } is a set of pairwise compatible, abelian 
n— tuples, then any linear combination YlT=i ^kX k is again an abelian n— tuple 
compatible with all the x^s, so that the set coiw{x 1 ,x 2 ■ ■ ■ i s a convex do- 

main for the operator function /. This also means that any set S a of pairwise 
compatible, abelian n— tuples in a C*— subalgebra A of M(Sj) is contained in a 
maximal set <S, which by necessity must be a closed, linear subspace of A™ & - One 
may wonder how such maximal sets look like, and a few experiments show that the 
variety is wide. Let C be a commutative C* —subalgebra of A such that C" = C 
(where ' denotes relative commutant). For example, C could be the the center of 
A (in which case C = A), or it could be any maximal abelian C*— subalgebra of 
A (in which case C = C). Note though, that the condition C" = C means that C 
always contains the center of A. Now fix a non-zero vector (ei, . . . ,e n ) in R n and 
define 

S = {x = {e x x + ci, . . . , e n x + c n ) \ x e C' sa , a e C sa } . (7) 

Then it is easy to check that S is a maximal set of pairwise compatible, abelian 
n— tuples in A. 

The more useful examples occur, however, at the other extreme of the situation 
above. We assume that the C*— algebra A comes equipped with a set of pairwise 
commuting C*— subalgebras Ai, . . . , A n . Then the subspace 



n 

^J)(^i)sa = {x = (xi, . . . , X n ) | Xi G (-Ai)sa} 



(8) 
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consists of pairwise compatible, abelian n— tuples; and under the mild extra condi- 
tion that each A% equals the relative commutant in A of the C* —algebra generated 
by the Aj : s for j ^ i, (i.e. (Uj^Aj)' = A%) the space is also maximal. This con- 
dition may be achieved by replacing in turn each of the algebras Ai by (Uj/i A?)'- 
This frame applies readily to the seminal situation where A = Ai <S> • • • <S> A n in 
B(fj). Indeed, most authors that have considered operator functions of several vari- 
ables have followed Koranyi's lead and used the functions only on tensor products, 
cf. [14]. 

In the setting of compatible, abelian tuples we are going to replace the trace 
Tr on the Hilbert space by a densely defined, lower semi-continuous trace r on an 
abstract C*— algebra A] i.e. a functional defined on the set A+ of positive elements 
with values in [0, oo], such that r(x*x) = t(xx*) for all x in A. Thus we shall 
consider the function x — > r(f(x)) on a set of compatible, abelian n— tuples in Asa- 
Some of our results have appeared in more primitive versions before. The tracial 
convexity of the function x — > f(x) on the space of n— tuples in ©" =1 (^4i)sa with 
values in ®" =1 Ai was proved by the first author for matrix algebras in [8]. His 
result was extended to general operator algebras and traces by the second author in 
[27]. Both proofs rely on Frechet differentiability and somewhat intricate manipu- 
lations with first and second order differentials. It was then realized by Lieb that 
his proof, mentioned above, could be extended to the case of several variables with 
only marginal changes, and the improved version appeared in [18]. The present 
version generalizes and subsumes the previous papers. In particular we show that 
the function x — > r(/(x)) is convex on any set of the form conv{x l7 . . . , x m }, where 
the x fc 's are pairwise compatible, abelian n— tuples in A sa . 

5. Measurable Fields of Operators. Let A be a (separable) C* — algebra of 
operators on some (separable) Hilbert space S) and T a locally compact metric 
space equipped with a Radon measure \x. We say that a field (at)teT of operators 
in the multiplier algebra M(A) of A, i.e. the C*— algebra of elements a in B(j}) such 
that xA + Ax C A, is weak* measurable if each function t <p(at), where cp G A*, 
is /i— measurable. It is worth noticing that (at)teT is weak* measurable if (and only 
if) for each vector £ in fj the function t — > a t £ is weakly (equivalently strongly) 
measurable (because the set of linear combinations of vector functionals is weak* 
dense in A*). It follows that if both (at)teT and (btjteT are weak* measurable 
fields then also (atbt)teT is a measurable field. 

If the function t — > <p(at) is integrable for all states <p and f T \cp(at) \ d(i(t) < 7 
for some constant 7, in particular if the function t — > \\at\\ is integrable, there is a 
unique element in M(A), designated by f T at dfx(t), such that 

<p(^J a t dii(t)^ = J <p(a t )d^(t) if e A\ (10) 

cf. [26, 2.5.15]. We say in this case that the field (at)teT is integrable. If all the 
at's belong to A then also f T a t dfx(t) belongs to A. If the weak* measurable field 
(a^a t )teT is integrable with integral 1 we say that (a t )teT is a unital column field. 

6. Final Notations. Consider now an n— tuple of weak* measurable, bounded 
fields (xit)teT, each consisting of self-adjoint elements in A with spectra in some 
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x t = (xu, . . . , x n t) is an abelian n— tuple. Furthermore, consider a unital column 
field (a t )teT in M(A), i.e. J T a* t a t dji{t) = 1. Assume finally that the elements 
Hi = j T a* t Xit(it d(i(t) in A form an abelian n— tuple. 

The commutation condition above for the y^s depends on intricate relations 
between the two measurable fields (xi t )teT and (a t )teT- It is, however, satisfied if 
the fields satisfy the following extension of the commutativity condition in (5): 

[citXitCit, alxj s a s ] = [alxj t a t , a*Xi s a s ] for all s and t. (11) 

Thus in particular if Xi t a t a*Xj s = Xj t a t a*Xi s for all s and t. 

For ease of notation we shall write I = I± x • • • x I n and A = (Ai, . . . , A n ) if 
A G I. Moreover, we regard the vector space of n— tuples in A as a bimodule over 
M(A) and write x t = {x\ t , . . . , x nt ) and a\x_ t a t = (a^xi t a t , . . . , a^x nt a t ), so that 
y = (yi,---,y n ) = j T a* t x t a t d^{t). 

We finally recall that the centralizer of a positive functional <p on A is the 
C*— subalgebra A v = {y E A \ Vx G A: ^p(xy) = (p(yx)}. If (p is unbounded, 
but lower semi-continuous on A+ and finite on the minimal dense ideal K(A) of A, 
we define A v = {y G A \ Wx G -K"(.A) : <p(xy) — <p(yx)}. 

7. Theorem. Let (x t )teT be a bounded, weak* measurable field of abelian n— tuples 
in a C* — algebra A, with sp(xn) C Ii for 1 < % < n, and let (a t )teT be a unital 
column field in M(A) such that the elements yi = j T a* t xaat d[i(t) form an abelian 
n— tuple. Then for each continuous, convex function f defined on the cube J = 
7i x • • • x I n in lR n and every positive functional <p that contains the yi 's in its 
centralizer A v , i.e. ^p(xyi) = <fi(yix) for all x in A and every yi, we have the 
inequality: 

V (j Qf a^MM*))) < V (J T a* t f(x t )a t d^ . (12) 

If if is unbounded, but lower semi- continuous on A+ and finite on the minimal 
dense ideal K(A) of A, the result still holds if f > 0, even though the function may 
now attain infinite values. 

Proof. Let C = C (S) denote the commutative C*— subalgebra of A generated by 
yi, . . . , y n , and let be the finite Radon measure on the locally compact, metric 
space S defined, via the Riesz representation theorem, by 

/ y(s) d^(s) = <p(y) yeC = C (S). (13) 
Js 

Since for all (x, y) in M(A) + x C + we have <p(xy) = (p(y 1 / 2 xy 1 ^ 2 ) it follows that 

0<<p(xy)<\\x\\<p(y). (14) 

Consequently the functional y — > <p(xy) on C defines a Radon measure on S domi- 
nated by a multible of fi^, hence determined by a unique element $(x) in (S). 
By linearization this defines a conditional expectation $:M(A) — > (S) (i.e. a 
positive, unital module map) such that 



/ y(s)$(x)(s) duJs) = ip(yx), yeC x G M(A). (15) 
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Inherent in this formulation is the fact that if y G C = C (S), then $>(y) is the 
natural image of y in (S). In particular, y(s) = $>(y)(s) for almost all s in S. 

Observe now that since the C*— algebra C (I_) is separable we can for almost 
every s in S define a Radon measure fx s on £ by 

j^{\)d^ s {\) = $ ^a*g(x t )a t d^ (s) g G C (f). (16) 

As f T a\at dfj,(t) = 1 this is actually a probability measure. 
If we put <7i(A) = Aj for 1 < i < n then 

J^gi{\)dn s {\) =§^aixita t d f i(t) y j (s) = $(yi)(s) = yi (s). (17) 

Since yi G C for all i we get by (17) - using the convexity of / in form of the 
standard Jensen inequality - that 

f(y)(s) =f(y(s)) = f(yi(s),...,y n (s)) 

< ^/G/i(A),...,</»(A)) <WA) (18) 



(19) 



Integrating over s now gives the desired result: 

<p(f(v)) = [ f(y)(s)d^(s) 

< Q T a* t f(x t )a t d/j(t^J (s)d^(s) 

= ^(a* t f(x t )a t )(s)dfi (p (s)dfi(t) 

JT JS 

= J T ^(a*f(x t )a t ) d/j,(t) = <p (^J^a*f(x t )a t dii(t)^ . 

Having proved the finite case, let us now assume that <p is unbounded, but lower 
semi-continuous on A+ and finite on the minimal dense ideal K(A) of A . Such 
functionals were termed C* — integrals in [23] and [24]. This - by definition - 
means that <p(x) < oo if x G A+ and x = xe for some e in A+, because K(A) 
is the hereditary *— subalgebra of A generated by such elements, cf. [25, 5.6.1]. 
Restricting <p to C we therefore obtain a unique Radon measure fx v on S such that 

/ y(t)d^(t)=<p(y) yeC. (20) 

Inspection of the proof above now shows that the Jensen trace inequality still holds 
if only / > 0, even though oo may now occur in the inequality. □ 
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8. Remarks. The second condition in Theorem 7, that the elements j/j are mu- 
tually commuting, is not easy to verify. There are, however, a few cases that can 
be handled with ease. In the first we simply set n = 1, so that we obtain the one- 
variable extension of Theorem 2. This is done in Corollary 9. In the second case we 
let each at be a positive scalar and set a^at = A(£). Then if [xu, xj s ] = [xjt, X{ s ] for 
all i,j, s and t (so, in particular [xu, Xjt] = 0), the elements yi = j T xu\(t) dfi(t) 
will form a abelian n— tuple. Thus in Corollary 11 we obtain an extremely strong 
version of the convexity of the trace function, proved in weaker forms in [8, 27, 
18]. 

9. Corollary. For each convex, continuous function f on an interval I, every 
bounded, weak* measurable field (x t )teT an ^ ever U united column field (a t )teT 
in M(A) we have the inequality 

ip ( / y^a* t x t a t dfi(t)^ < ip a t 7(x t )M//(*)) (21) 

for every positive functional <p that contains the element y = f T a\xtat dfi(t) in its 
centralizer. 

If ip is unbounded, but lower semi-continous and finite on the minimal dense 
ideal K(A) of A, the result still holds if f > 0. □ 

For continuous fields this result was proved in [11, Theorem 4.1]. 

10. Corollary. For each convex, continuous function f on a cube £ = I\ x • • • I n 
in M, n , each probability measure [i on a locally compact, metric space T and every 
n— tuple of bounded, weak* measurable fields (xi t )teT, where xa E A^ for 1 < i < 
n, such that [xn, Xj s ] = [xj t , Xi s ] for all i,j, s and t we have 



f^x t d^t)^ <^^(/fe))^)), (22) 



for every positive functional <p on A that contains the elements yi = f T Xi t d/i(t) in 
its centralizer. □ 



Specializing to convex combinations (discrete probability measures) and traces 
we obtain the following version of Corollary 10: 

11. Corollary. For each convex, continuous function f on a cube £ = I\ x • • ■ I n 
in M. n , and every trace r on a C* — algebra A the function 

x r(f(x)) (23) 

is convex on the set of compatible pairs of abelian n— tuples x = (xi, . . . , x n ) in A 

with sp(xi) G Ii for all i. □ 



The condition in Theorem 7 that the elements xa and Xjt commute mutually is 
also rather awkward. An easy and important solution to this problem is to assume 
from the outset that the C* —algebra A comes equipped with mutually commuting 
C* — subalgebras Ai, . . . , A n and then require that xu G Ai for all i and t. Now the 
domain of definition of / is the convex set ®" =1 (^4i)fi and we can state the Jensen 
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12. Corollary. For each convex, continuous function f on a cube I = I± x • • • x I n 

in M, n and every C* — algebra A with mutually commuting C* — subalgebras Ai, . . . A n 
we have the inequality 

V? (f Qf a^a^f))) < (p ^J^ t f{x t )a t d^ (24) 

for each bounded, weak* measurable field (x t )t&T = ((^it)teT, • • • , (x n t)teT) in 
©r=i(^)sL an d ever y unital column field (a t )teT in M(A), provided that the ele- 
ments yi = f T a\xnat d(x(t) form an abelian n— tuple and the functional <p contains 
these elements in its centralizer. 

If ip is unbounded, but lower semi-continous and finite on the minimal dense 
ideal K(A) of A, the result still holds if f > 0. □ 

This result generalizes both [11, Theorem 4.1] and [27, Theorem 2]. 

In the next case let the parameter space be N n x T, where N n denotes the finite 
subset {1,2,..., n}. So instead of the index t we now have the double index (j, t). 
We then assume that Xijt = xu if j = i and that Xijt = if j ^ i. Furthermore we 
assume that ajt G M(Aj) for all j, so that we have the elements bj = j T a* t ajt dixit) 
in M(Aj) with Y^=i bj = 1- Note now that 

Vi = ^2 a* t x ijt a jt dn{t) = / a* t x it a it d/j,(t) G A, (25) 
j= i j t Jt 

so the commutativity condition is trivially satisfied. Consequently we have the 
following result: 

13. Corollary. For each convex, continuous function f on a cube I = I\ x • • • x 
I n , where G Ii for each i, every n— tuple of bounded, weak* measurable fields 
(xu)teT C (A)sa an d every n— tuple of integrable column fields [aitjteT in M(Ai) 
with J27=i It a it a it dfi(t) = 1 we have the inequality: 



^(/(Z a it Xltaitd ^> • • •' J Kt x nta n tdfx(t) 

/ n T T \ (26) 

< <P J a itf(°' • • • ' X «> • • • ' °) a « d n(t)\ 

for every positive functional p> that contains the elements yi = f T a* t Xi t ai t d/i(t) in 
its centralizer. □ 



In the last case we again use the parameter space N„xT and take a jt in M(Aj) 
for all j, but we now put x^t = Xi constantly for some fixed Xi in (A)sa- Then 

Vi = ^2 a*jt x i a jtd^(t) = / a* t Xia it dii(t) + (1 - b^Xi G A, (27) 
■ =1 Jt Jt 
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14. Corollary. For each convex, continuous function f on a cube I, every n— tuple 
x with elements Xi in (Ai)li and every n— tuple of integrable column fields (aa)teT 
in M(Ai), with Ym=\ ^ = 1> where bi = f T a* t ai t d/j,(t) , we have the inequality: 

<f(f (^J aj t xiaitd//(t) + (l-6i)xi,...,y a* nt x n a nt d/j,(t) + (1 - b n )x n ^ 

< <P j T a itf( x ii • • • > x n)a it d/i(t)j (28) 

for every positive functional <p that contains the elements yi = f T a* t Xiai t d/i(t) + 
(1 — bi)Xi in its centralizer. □ 

15. Monotonicity. We conclude the paper with some results about monotonicity 
of operator functions under a trace or a trace-like functional. The tendency is that if 
/ is monotone increasing in each variable and x = (x\, . . . , x n ) and y = (y\, . . . , y n ) 
are abelian tuples, so that we have a chance to define f(x) and /(y), then ip(f(x)) < 
ip(f(y)) if only Xi < yi for all i. This result may or may not be true in general. We 
can prove it when / is either convex or concave, or when x and y are compatible. 

16. Theorem. Let f: I — > K. be a continuous function on a cube I = [ai, Pi] x • • • x 
[a n ,(3 n ] in W 1 , and assume that f is monotone increasing in each variable. Then 
for any two abelian n— tuples (xi, . . . , x n ) and (yi, . . . , y n ) of self-adjoint elements 
in a C* — algebra A with a^l < xi < yi < fyl for all i we have the inequality 

< fifiv)) (29) 

for any positive functional tp on A that contains the elements x\,...,x n in its 
centralizer, provided that f is also convex. If instead f is concave the result holds 
if the elements yi, ■ ■ ■ ,y n belong to the centralizer of <p. 

Proof. Let C = C (S) denote the commutative C* — subalgebra of A generated by 
the Xis. As in the proof of Theorem 7 we then obtain a Radon measure on £ 
and a conditional expectation $:M(^4) — * (S) such that 



z(s)$(y)(s)dfiv(s) = (p(zy), zeC y G M(A), 

Js 



(30) 



where <&(z)(s) = z(s) almost everywhere on S for each z in C. 

Since £ is separable we can for almost every s in £ define a probability measure 
H s on / by the formula 

^(A)^ S (A) = ®(g(y))(s) g e C{I_). (31) 
If we set gi(X) = Xi for each i, this means that 



/ 9i(X)d^ s (X) = $( 9 i(y))(s) = ®( yi )(s). (32) 
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Assume now that / - in addition to being monotone increasing - is also convex 
on the cube I. Then, using that f(x) £ C it follows that 

*(/G»D)00 = /(*)(*) = f(Ms), x n (a)) 
= f . . • , *(!„)(«)) < / (*(yi)(a), • • • , *(y»)(a)) 



(33) 



= / ^0i(A) d^ s (X),..., J^g n (X) d/i s (X) 
< J f (<7i(A), ■ ■ ■ , <7n(A)) c^ s (A) = ^ /(A l7 . . . , A n ) e^ s (A) 

where we used the monotonicity of / to obtain the first inequality sign in (33) and 
the convexity of / - in form of the usual Jensen inequality - to obtain the second 
inequality sign. Integrating over s now yields the desired result: 



<p(f(x))= / $(/(*))(*) <WM 

Js (34) 

< / ®(f(y))(s)d^(s) = <p(f(y)). 
Js 

If on the other hand we assume that / is a concave function we simply permute 
the roles of the n— tuples x and y and let C denote the C*— subalgebra of A generated 
by the y^s. The conditional expectation $>:M(A) (S) now satisfies that 

<fr(j/j)(s) = yi(s) almost everywhere. Similarly we redefine the probability measures 
H s by the new formula fjg(X) dfx s (X) = $(g(cc))(s), so that now 

J gi (X)dfi a (X) = ®(gi(x))(s) = ^(xi)(s). (35) 

It follows as in (33) that we have the inequalities 

= J /(Ai, • • • , A„) d// s (A) = J f (0i(A), • • • , 9n{X)) dfi s (X) 
f (V £1 (A) d^ s ( A) , • • • , J^Qn (A) dn s (A) 



< 



(36) 



= / ($(xi)(s), . . • , *(x n )(s)) < f ($(yi)(s), • • • , $(y„)(s)) 
= /( yi ( a ), . . . , y„( a )) = /(y)(a) = 



where we now used the concavity of / to obtain the first inequality in (36). Inte- 
grating over s we again get the desired inequality (29). □ 



12 



FRANK HANSEN & GERT K. PEDERSEN 



17. Remarks. Evidently we may combine the two conditions in Theorem 16 to 
show that if / is an increasing function which admits a decomposition / = /+ + /_, 
where /+ and /_ are both increasing and /+ is convex whereas /_ is concave, 
then <p(f(x)) < <p(f(y)) if x < y, provided that all the elements xi,...,x n and 
yi , . . . , y n belong to the centralizer of <p. However, such a decomposition, even 
approximately, is not possible in general, not even in the one-variable case. The 
reader may check that sin(t), for — tv/2 < t < it/2 can not be approximated by 
any function / = / + + /_, where f + is convex and /_ is concave, and both are 
increasing. In fact, || sin— /||oo > (27r) — 2 . 

The simple function f(s,t) = st is neither convex nor concave, but increases 
in each variable on the first quadrant. One proves by direct calculations that if 
xi,yi and £2,2/2 are positive elements in a C*— algebra A with xi < X2, yi < 2/2 
then r(xiyi) < rix^yo) for every trace r on A. The simple argument relies on the 
cyclicity of the trace, which for two factors is equivalent to commutativity, but does 
not need the commutator equations [xi,yi] = [^2,2/2] = 0, which we are prepared 
to insert to get abelian tuples. This particular argument fails for three factors, so 
that we are not able to decide whether the function /(r, s, t) = rst is an increasing 
trace function on positive abelian triples. 

Despite this setback one may still hope that the function x — > T~(f(x)) is increas- 
ing on the set of abelian n— tuples, provided only that / is monotone increasing in 
each variable; at least when r is a trace or a trace-like functional. Our last result, 
an extension of [27, Corollary 5], shows that this is true when the two abelian 
n— tuples are compatible. The proof uses the Frechet differential as in [10]. 



18. Proposition. Let f be a continuous function on a cube L = [oji,/?i] x • • • x 
[a n ,(3 n ] in M. n , and assume that f is increasing in each variable. Then for any two 
compatible, abelian n— tuples x and y in a C* — algebra A that satisfy oti < Xi < yi < 
Pi for all i, we have the inequality (p(f(x)) < (p(f(y)) for any positive functional ip 
on A that contains all the elements x±, . . . , x n and yi,...,y n in its centralizer. 

Proof Put h = y-x, and let g(t) = <p(f(x+th)) = ip(f((l-t)x+ty)) for < t < 1. 
(Note that this is well defined since x and y are compatible.) Then 

¥>(/(!/))- ¥>(/(*)) = C 9\t)dt, (37) 

Jo 

provided, of course, that g is differentiable. However, working by approximation 
- extending / to a bounded increasing function on M. n and convolving it with a 
suitable approximate unit for L 1 (M n ) like (e/n) n / 2 exp(— ss-s) - we may assume that 
/ is extendable to a Schwartz function on M n , whence f(u) = J R „ exp( i(u-s)) f(s) ds. 
Consequently, with z = x + th, 

g\t) =Hm e- 1 <p(f(z + eh) -f(z)) 

_i r - ( 38 ) 

— lime / <p (exp(i((z + eh) ■ s)) — exp^i^ • s))) f(s) ds. 
By the Dyson expansion of the operator function b — > exp(a + b) we have the ex- 
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this in (38) we get 

g'(t) = (p I i exp(i( ; z • s)r) i(h ■ s) exp(i(z • s)(l — r)) dr J f(s) ds 
JR n \Jq J 

= ¥>(exp(i(z-s)) i(h- s))f(s)ds (39) 

n „ n 

= / V^PiKz- s)) h k) is k f{s)ds = y^tp{f' k {z)hk), 
fc=i fc=i 



where we used that the element z ■ s, hence also exp(i(z ■ s)(l — r)), is in the 
centralizer of (p. Since f' k >0 and h k > for all it follows that > 0, whence 
<p(f(x)) < <p(f(y)) by (37), as desired. □ 
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